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A lexical modeling methodology was employed to examine how the distribution of phonemic 
patterns in the lexicon constrains lexical equivalence under conditions of reduced phonetic 
distinctiveness experienced by speechreaders. The technique involved (1) selection of a 
phonemically transcribed machine-readable lexical database, (2) definition of transcription rules 
based on measures of phonetic similarity, (3) application of the transcription rules to a lexical 
database and formation of lexical equivalence classes, and (4) computation of three metrics to 
examine the transcribed lexicon. The metric percent words unique demonstrated that the distribution 
of words in the language substantially preserves lexical uniqueness across a wide range in the 
number of potentially available phonemic distinctions. Expected class size demonstrated that if at 
least 12 phonemic equivalence classes were available, any given word would be highly similar to 
only a few other words. Percent information extracted (PIE) [D. Carter, Comput. Speech Lang. 2, 

1-11 (1987)] provided evidence that high-frequency words tend not to reside in the same lexical 
equivalence classes as other high-frequency words. The steepness of the functions obtained for each 
metric shows that small increments in the number of visually perceptible phonemic distinctions can 
result in substantial changes in lexical uniqueness. © 1997 Acoustical Society of America. 
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INTRODUCTION 

Spoken word recognition depends on the process of se- 
lecting a word candidate from a set of word patterns stored in 
memory. Current models of auditory spoken word recogni- 
tion agree that the speed and ease of this selection process is 
partially a function of the lexical properties of the target 
word (Lahiri and Marslen-Wilson, 1991; Luce et al., 1990; 
McClelland and Elman, 1986; Norris, 1994). Spoken word 
recognition is facilitated for frequently used words and for 
words perceptually similar to few other words (i.e., words 
with few neighbors). The effects of these lexical properties 
are particularly important when the phonetic information in 
the speech signal is degraded (Luce et al., 1990). In 
speechreading (lipreading), where spoken word recognition 
occurs on the basis of viewing rather than listening to the 
talker, the optical speech signal typically affords less pho- 
netic distinctiveness 1 than the acoustic speech signal. The 
target word’s frequency and perceptual similarity to other 
words will likely be a substantial contributor to the speed 
and ease of word recognition for the speechreader. In the 
current study, we modeled visual perceptual similarity 
among words as a function of the number of perceptually 
available phonemic distinctions. 

Even under optimal perceptual conditions (e.g., adequate 
lighting, moderate speaking rate, highly visible articulatory 
gestures), phonetic information is inadequate to specify all 
the phonemes in the speechreader’ s language. For example, 
speechreaders may not perceive any distinctions among pro- 
ductions of the consonants /b/, /p/, and /m/ (Scheinberg, 



1988). The degree to which phonemic distinctions are un- 
available is to a certain extent due to the articulatory charac- 
teristics of the talker (Kricos and Lesner, 1982; Lesner, 
1988), the conditions (lighting, viewing angle, distance), the 
place of articulation, and the perceptual abilities of the 
speechreader (Jeffers and Barley, 1971). Current spoken 
word recognition models (Lahiri and Marslen-Wilson, 1991; 
Luce et al., 1990; McClelland and Elman, 1986; Norris, 
1994) imply that loss of phonemic distinctions by itself does 
not predict word intelligibility: because words are hypoth- 
esized to be recognized within the lexical context of percep- 
tually similar words, the distribution of phoneme patterns for 
words in the language is also determinative of intelligibility. 

For example, the English word ‘ ‘bought’ ’ remains lexi- 
cally distinct from all other words after the loss of the dis- 
tinctions between /b/, /p/, and /m/. “Pought” and “mought” 
are not words; therefore, a misperception of “bought” as 
either of these nonwords could nevertheless result in an ac- 
curate identification of the only existing word “bought.” 
Alternatively, under the same conditions, “bat” would not 
have a similar advantage versus “pat” and “mat.” 

Nitchie (1916) and Berger (1972) attempted to estimate 
the loss of perceptual uniqueness (i.e., homopheny) of En- 
glish words experienced by speechreaders. According to 
Nitchie, approximately 50% of words in colloquial English 
speech are perceptually unique for the average speechreader. 
Berger estimated that, taking frequency of use into account, 
between 51% and 55% of the common words in English 
remain unique. Although the authors never fully specified 
their methods, it is clear that they used a fixed estimate of the 
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number of available phonemic distinctions, which did not 
take into account the occurrence of variation in visual pho- 
netic distinctiveness due to a range of factors (some of which 
were mentioned earlier). Furthermore, their analyses focused 
solely on percentage of unique words in the lexicon. Because 
the number of words perceptually similar to the target word 
influences its speed and ease of auditory spoken word recog- 
nition (Luce et al., 1990), the number of words a 
speechreader must discriminate among during recognition 
must be taken into account in order to obtain informative 
estimates. A computational modeling study of lexical 
uniqueness in English was undertaken in which the number 
of available phoneme distinctions was systematically varied 
to model a range of visual-phonetic information. Several dif- 
ferent quantitative measures were also examined in order to 
understand better how the distribution of word phoneme pat- 
terns in English is affected by the number of available pho- 
nemic distinctions. 

I. LEXICAL MODELING 
A. Methods 

The methods employed for the current study were origi- 
nally developed in automatic speech recognition to assess the 
feasibility of using broad phonetic transcription to select a 
small subset of lexical candidates from a large lexical data- 
base (Huttenlocher and Zue, 1984; Carter, 1987) and have 
also been applied to the study of human spoken word recog- 
nition (see Altmann, 1990; Altmann and Carter, 1989; Pisoni 
et al, 1985). 

The methodology was applied as follows: First, a pho- 
nemically transcribed machine-readable lexical database was 
selected to serve as a representative sample of the words in 
the language. Along with a phonemic transcription, each 
word in the database had an estimate of its frequency of 
occurrence in the language. Second, transcription rules were 
defined on the basis of measures of phonetic similarity. The 
transcription rules were in the form of single-symbol substi- 
tutions for all phonemes in phonemic equivalence classes. A 
phonemic equivalence class comprised the set of phonemes 
rendered equivalent by the loss of phonetic distinctiveness. 
(For example, if /b/, /p/, and /m/ belong to a single phonemic 
equivalence class, then a rule was defined to transcribe each 
occurrence of /b/, /p/, and /m / into a symbol representing the 
equivalence class.) Third, the lexical database was then tran- 
scribed according to the rules. Lexical equivalence classes 
were formed by collapsing across identically transcribed 
words. (For example, under the phoneme equivalence class 
definition given above, “pat” and “bat” would both belong 
to the same lexical equivalence class.) Lastly, metrics were 
computed to compare the distribution of patterns in the 
newly transcribed lexicon with the distribution of patterns in 
the original lexicon. 

1. Lexical database 

The method described above was applied to the PhLex 
database (Seitz et al., 1995). PhLex’s entries include the 



19 052 most frequent words in the Brown corpus (Kucera 
and Francis, 1967), the 19 750 words listed in the Hoosier 
Mental Lexicon (Nusbaum et al., 1984), and 1173 words ex- 
tracted from stimulus and response sets from our laboratory. 
In total, after accounting for overlapping entries in the source 
lists, the PhLex database contains 32 377 unique ortho- 
graphic entries. In addition to orthographic transcriptions, all 
of PhLex’s entries have baseform phonemic transcriptions 2 
that include stress and syllabification symbols, estimates of 
frequency of usage (Kucera and Francis, 1967), and subjec- 
tive familiarity ratings (Nusbaum et al, 1984). When an es- 
timate of a word’s frequency of occurrence was not avail- 
able, it was assumed to be equal to 1. All frequencies were 
log-transformed to the base 10. 

2. Transcription rules 

Sets of transcription rules were developed using previ- 
ously published perceptual data on speechreading (Eberhardt 
et al., 1990; Montgomery and Jackson, 1983). Consonant 
transcription rules were based on visual consonant identifi- 
cation data collected in a Consonant-/a/ environment (Eber- 
hardt et al., 1990), with overall percent correct of approxi- 
mately 34.6%. Vowel transcription rules were based on 
visual vowel identification data collected in the environment 
/h/-Vowel-/g/ (Montgomery and Jackson, 1983), with overall 
percent correct of 54.2%. Estimates of visual phonetic simi- 
larity were obtained from multidimensional scaling solutions 
of the obtained consonant and vowel confusion matrices 
(Bernstein et al., 1994). The analyses employed 40 pho- 
nemes, 17 vowels and 23 consonants. The similarity esti- 
mates were submitted to separate hierarchical cluster analy- 
ses using the average linkage between groups method 
(Aldenderfer and Blashfield, 1984; SPSS, 1990). Hierarchi- 
cal cluster analysis was used to algorithmically generate 
nested sets of phonemic equivalence classes by incremen- 
tally joining phonemes based on their estimated similarity. 

Sets of transcription rules (see Table I) were generated 
to model a range of visual phonetic distinctiveness due to a 
range of viewing conditions, talker characteristics, and 
speechreaders’ abilities. Transcription rule sets were gener- 
ated by hierarchically varying the total number of phonemic 
equivalence classes between 1 and 40. Thus, when there was 
only one equivalence class, all the consonants and vowels 
were transcribed as a single symbol. When there were 40 
equivalence classes, each one contained a unique phoneme 
of English. Because perceptual data were not available for 
/g j q/, /g / was assumed to be most similar to /a/, /j/ was 
assumed most similar to /a/, and /[)/ was assumed most simi- 
lar to /g/. Vowels and consonants were assumed to be maxi- 
mally dissimilar, except for the consonant /j/ which was in- 
cluded in the vowel confusion matrix. Thus, except for the 
one equivalence class that contained all the phonemes and 
the ones that contained /j/, no phonemic equivalence class 
contained both consonants and vowels. 

Table I lists the sets of phonemic equivalence classes 
employed in the current study. The table shows that the num- 
ber of vowel and consonant equivalence classes was allowed 
to increase at the same rate. One heuristic for determining 
the appropriate number of clusters to represent a perceptual 
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TABLE I. Phonemic equivalence classes employed in the development of transcription rule sets. All phonemes 
within a set of brackets were considered perceptually equivalent. 



Number of phonemic 
equivalence classes 


Phonemic equivalence classes 


28 


{u} {u} {sr} {o} {au} {i,i} {e,s} {<e} { 01 } {d} {ai} {3,a,A,j} 
{b.p.m} {f,v} {1} {n} {k} {g.g} {h} {d} {t} {s,z} 
{w,r}{6,0}ffi{t,J}{ 3 }{d 3 } 


19 


{u,u,or} {o,au} {i,i} {e,s} {as} {oi} {ai,9,a,Aj,i} {b,p,m| 

{f,v} {1} {n,k} {g.g} {h} {d} {t,s,z} {w.r} {6,0} {f.tf, 3 ,d 3 } 


12 


{u,u,sr} {o.au} {i,i,e,e,se} { 01 } {o,ai,3,a,A,j} {b.p.m} {f.v} 
{l,n,k,g,g.h} {d,t,s,z} {w.r} {6,0} {f.tj, 3 ,d 3 } 


10 


{u.u.sr} {o.au} {i,i,e,e,as} {oi,o,ai.3,a,A,j} 

{b.p.m} {f,v} {l,n,k,g,g,h,d,t,s,z} {w,r} {6,0} {f,t(, 3 ,d 3 } 


2 


{u,u,3r,o,au,i,i,e,E,ffi,3i.D,ai,3,a,A,j} 

{b,p,m,f,v,l,n,k,]j,g,h,d,t,s,z,w,r,6,0,j,tj, 3 ,d 3 } 


1 


{u,u,3r,o,au,i,i.e,E,ffi,3i.D,ai,3,a,A,j.b,p,m,f,v,l,n,k,n,q.h,d,t,s, 

z,w,r,6,0,j,tf, 3 ,d 3 } 



data set is to look for large increases in the distance coeffi- 
cients over which clusters are being formed at each stage of 
the analysis. The logic behind this strategy is that a large 
increase in the distance coefficient reflects the joining of two 
relatively dissimilar clusters, and therefore the appropriate 
cluster solution includes all the clusters formed up to, but not 
including, that stage (Aldenderfer and Blashfield, 1984). Ex- 
amination of the distance coefficients obtained in the current 
cluster analyses suggested that 12 phonemic equivalence 
classes best approximated the phonemic distinctions per- 
ceived by the speechreaders whose data generated the con- 
fusion matrices. Furthermore, these 12 phonemic equiva- 
lence classes correspond to previous estimates of visually 
available phonemic distinctions (see Jackson, 1988). In ad- 
dition, 28, 19, and 10 phonemic equivalence classes were 
selected to represent f, and j of total number of vowels 
and consonants. 



3. Application of transcription rules 

Following transcription, lexical equivalence classes 
were formed by collapsing across identically transcribed 
words. Table II displays examples of the application of rules 
to the words “tan” and “cat.” The first column in Table II 
gives the number of equivalence classes in the transcription 
set from which each set of relevant rules for the example is 
taken. (The complete sets are in Table I.) The third and 
fourth columns list the transcriptions of “tan” and “cat,” 
after the application of the transcription rules. The two words 
enter into the same lexical equivalence class for all rule sets 
with ten or fewer phonemic equivalence classes. 

The distribution of perceptually similar word patterns 
could be influenced by lexicon size or method of subsam- 
pling the lexicon. Specifically, common and rare words dif- 
fer in phonemic composition, and common words tend to be 



TABLE II. Transcription examples for “tan” and “cat.” All phonemes within a set of brackets were rewritten 
into a single symbol represented here by upper-case letters. 



Number of phonemic 
equivalence classes in 
transcription rule set 


Relevant transcription rules 


Transcriptions 

of 

“tan” “cat” 


19 


{ae}=>A 

{t,s,z}=>T 

{n,k}=>N 


TAN 


NAT 


12 


{i,i,e,£,a 2 }=>A 

{d,t,s,z}=>D 

{l,n,k,g,g,h}=>N 


DAN 


NAD 


10 


{i,i,e,£,a 2 }=>A 

{l,n,k,g,g,h,d,t,s,z}=>D 


DAD 


DAD 


2 


{u,u,9r,o,au,i,i,e,£,ae,oi,3,ai,3,a,A,j}=>V 

{b,p,m.f.v.l,n,k,g,g,h,d,t,s,z,w,r,6,0,J.tf, 3 ,d 3 }=>C 


CVC 


CVC 


1 


{u,u,3r,o,au,i,i,e,E,as,oi,o,ai,3,a,A,j,b,p,m,f,v,l,n, 

k,g,g,h,d,t,s,z,w,r,6,0.J,tf, 3 ,d 3 }=>P 


PPP 


PPP 
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more like other common words (Landauer and Streeter, 
1973). Thus, transcriptions and metrics were applied to three 
partitionings of the lexical database. One corresponded to the 
1052 most frequent words in the Brown Corpus (Kucera and 
Francis, 1967), henceforth, “IK;” the second to the 4995 
most frequent words, “5K;” and the third to the entire 
31 081 word lexicon, “30K.” 3 Several words at the 1000- 
word and 5000-word cutoff points had identical frequencies 
of occurrence. 

The presence or absence of stress and syllabification in 
the transcriptions was manipulated to explore effects of su- 
prasegmental information. The PhLex database contains two 
levels of marked stress and a third unmarked level. When 
stress and syllabification were taken into account, two words 
were considered equivalent only when their phonemic, 
stress, and syllabification patterns were identical. For ex- 
ample, the noun ‘ ‘convert’ ’ and the verb ‘ ‘convert’ ’ were not 
equivalent when stress was taken into account in the analy- 
sis. 




Number of Phonemic 
Equivalence Classes 

FIG. 1. The log-frequency- weighted percentage of words unique for 1-40 
phonemic equivalence classes is plotted for 30 K-, 5 K-, and 1 K-word 
partitions of the lexicon. The unweighted percentage of words unique for 
1-40 phonemic equivalence classes is plotted for the 30 K partition of the 
lexicon. 



4. Quantitative analyses 

Following transcription, three commonly employed met- 
rics were computed on the results (see Altmann, 1990). Each 
metric examined a different aspect of lexical equivalence. 
The first metric, frequency-weighted percent words unique, 
estimated the proportion of unique words in the transcribed 
partition of the lexicon. The second metric, frequency- 
weighted expected class size, estimated the average number 
of words in lexical equivalence classes in the transcribed 
partition of the lexicon. The third metric, percent informa- 
tion extracted, or PIE (Carter, 1987), estimated information 
remaining in the lexicon. The PIE values are a function of 
the size of resulting lexical equivalence classes and the dis- 
tribution of frequencies for words within lexical equivalence 
classes. 

Frequency-weighted percent words unique was com- 
puted as 

2"" 

%wu=^ r i — xioo, (i) 

2 L ,Fj 

l = 1 1 

where n v was the total number of unique entries after tran- 
scription, F u was the frequency of occurrence for unique 
words in the transcribed lexical partition, n L was the number 
of words in the lexicon, and F i was the frequency of occur- 
rence of words in the original lexical partition. The 
frequency-weighted metric was intended as an estimate of 
the extent to which unique words would be encountered in 
everyday language by a deaf speechreader. 

Frequency-weighted expected class size was computed 
as 

1 n E 

ECS=— E I a F a , (2) 

r L a= 1 

where F L was the sum of frequencies of occurrence of all 
words in the lexical partition, n E was the total number of 
lexical equivalence classes, I a was the number of words in 
equivalence class a, and F a was the sum of frequencies of 
occurrence of the words in equivalence class a. This metric 



was intended as an estimate of the average size of the 
frequency-weighted equivalence classes that would typically 
be encountered in everyday language. 

The PIE (Carter, 1987) was computed as 



PIE= 



K=\P» l °gPa 

— X 100. 

^itiPi lo E Pi 



(3) 



where n E was the total number of equivalence classes after 
transcription, p a was the sum of the probabilities of occur- 
rence for words in equivalence class a, n L was the number 
of words in the lexicon, /?, was the probability of occurrence 
of word i, and log was taken to the base 2. The probability of 
a word’s occurrence was computed by dividing its raw fre- 
quency of occurrence by the total number of occurrences in 
the Brown corpus (Kucera and Francis, 1967). 

The PIE is an information theoretic metric, developed by 
Carter (1987), of the amount of information extracted from 
the lexicon when the number of available phonemic distinc- 
tions is reduced. In essence, PIE is the number of binary 
units (bits) required to code the lexicon after transcription 
divided by the number of bits required to code the original 
lexicon multiplied by 100. For example, an original lexicon, 
containing four equally frequent words, could be coded by 
two bits. One bit dividing the entire lexicon in half and a 
second bit dividing each half of the lexicon in half again. A 
transcription of that lexicon resulting in two equally frequent 
lexical equivalence classes could be coded by a single bit. 
The resulting PIE would be equal to 50%. In contrast to 
expected class size, PIE is sensitive to the distribution of 
frequencies within lexical equivalence classes. Specifically, 
PIE is high when all the equivalence classes are roughly 
equal in frequency which is better approximated when fre- 
quent words are in different equivalence classes (see Carter, 
1987, for a detailed discussion of PIE). 



B. Results and discussion 

The results of the analyses for each log-frequency- 
weighted metric are plotted in Figs. 1-3 as a function of the 
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Equivalence Classes 

FIG. 2. The log-frequency-weighted expected class size for 1-40 phonemic 
equivalence classes is plotted for 30 K-, 5 K-, and 1 K-word partitions of the 
lexicon. The unweighted expected class size for 1 -40 phonemic equivalence 
classes is plotted for the 30 K partition of the lexicon. 

number of phonemic equivalence classes in a transcription 
rule set. Because results were unaffected by the presence 
versus absence of stress and syllabification, in Figs. 1-3 only 
the results for analyses with stress and syllabification were 
plotted. In addition, the unweighted statistics are also plotted 
in Figs. 1 and 2 for the 30K partition of the lexicon. Only 
small differences were observed between the unweighted and 
the log-frequency-weighted data for the 5K and IK parti- 
tions, thus the unweighted 5K and IK data were not plotted 
in Figs. 1 and 2. Examination of raw frequency-scaled analy- 
ses (i.e., frequencies were not log-transformed), in which all 
lexical partitions were influenced equivalently by the scaling, 
suggested that the small effect of log-frequency scaling on 
the 5K and IK partitions was related to a trivial interaction 
of the log scaling and the distribution of frequencies in these 
lexical partitions. 

In Figs. 1-3, 12 phonemic equivalence classes repre- 
sented the best fit by the cluster analyses for both the conso- 
nant and vowel data. Beyond 12 equivalence classes, the 
distance coefficients in the cluster analyses were very small. 
This means that additional equivalence classes that were 
formed as a result of hierarchically separating larger ones 




FIG. 3. The percent information extracted for 1-40 phonemic equivalence 
classes is plotted for 30 K-, 5 K-, and 1 K-word partitions of the lexicon. 



into smaller ones represented perceptual distinctions not re- 
liably made by the subjects who viewed the syllables. Thus, 
if the abscissas of the figures were translated into perceptual 
distance, they would be highly compressed as the number of 
equivalence classes increases beyond 12. 

Frequency-weighted percent words unique (Fig. 1) in- 
creased as the number of visible phonemic equivalence 
classes increased. Only a small effect of lexical partition size 
was obtained. These results demonstrate that the distribution 
of phoneme patterns constituting words in English substan- 
tially preserves visual lexical uniqueness, even when the 
transcription employs only 12 phonemic equivalence classes 
(i.e., far fewer than half the number of phonemes in the 
language). This number of equivalence classes corresponds 
to 54%-63% words unique across partitions. Berger’s 
(1972) frequency-weighted lexical uniqueness estimate of 
51% falls below our estimates. 

Frequency-weighted expected class size (Fig. 2) de- 
creased as the number of phonemic equivalence classes in- 
creased. Expected class size increased slightly as the size of 
the lexical partition increased. For 12 phonemic equivalence 
classes, frequency-weighted expected class size was 5.1 
words for the 30K lexicon. On average, any given word is 
predicted to be equivalent with only a few other words, when 
at least 12 equivalence classes are available. 

The PIE (Fig. 3) increased as the number of phonemic 
equivalence classes increased. However, the largest change 
in PIE was between 0 and 12 equivalence classes: above 12 
equivalence classes, most of the information has been ex- 
tracted from the lexicon. For 12 phonemic equivalence 
classes, 92% of the information was extracted. The PIE val- 
ues were essentially identical, independent of lexical parti- 
tion size. 

The high PIE values obtained even with few equivalence 
classes provide evidence that high-frequency words tend to 
reside in equivalence classes with lower frequency words. 
An implication of this finding is that a speechreader could 
optimize word recognition accuracy by selecting the most 
frequent word in a lexical equivalence class. The current 
results are somewhat surprising given the results Landauer 
and Streeter (1973) (Pisoni et al., 1985; but see Bard and 
Shillcock, 1993) have reported indicating that common 
words tend to be more like other common words. However, 
the present study differed from previous investigations not 
only in terms of the nature of similarity among words (visual 
versus auditory), but also in terms of the operational defini- 
tion of similarity among words, the lexical entries employed 
in the analyses, and the metrics employed to assess the simi- 
larity of words. Therefore, further analyses are required to 
assess whether the current results are idiosyncratic to 
speechreading or have general implications for spoken word 
recognition. 

II. GENERAL DISCUSSION 
A. Subject characteristics 

The present results demonstrate that lexical uniqueness 
varies substantially with the number of available phonemic 
distinctions. Thus, the level of performance in the nonsense 
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syllable confusion matrices used to estimate phonemic 
equivalence must be considered in interpreting results of the 
type presented here. 

Berger’s (1972) previous estimate of 51% words unique 
for the average speechreader is below our estimate of 54% - 
63% for 12 phonemic equivalence classes obtained on the 
basis of data from relatively inaccurate hearing speechread- 
ers. The consonant confusion data employed here were ob- 
tained from matrices with 34.6% correct responses. These 
subjects also speechread CID Everyday Sentences (Davis 
and Silverman, 1970) and had mean scores of 24.9% key- 
words correct at pretest and 35.3% at posttest. Although 
these scores are typical in the speechreading literature for 
hearing adult subjects, this performance level is low in com- 
parison with that observed for expert deaf speechreaders. 
Bernstein et al. (1996) reported that expert deaf speechread- 
ers (in the upper quartile of subjects) scored between 61% 
and 79% words correct on the same sentence materials. 
These expert deaf speechreaders have achieved 41% correct 
(with a range from 37%-47% correct) with the same CV 
materials (Eberhardt et al., 1990; Bernstein et al., 1996). The 
consonant confusion data employed in the current study is 
comparable to that used by Berger (1972). However, the cur- 
rent study may represent a conservative estimate of 
phonemic/lexical uniqueness in the skilled speechreader. We 
are currently collecting consonant and vowel confusion data 
from skilled deaf speechreaders to investigate this hypoth- 
esis. 

B. Lexical characteristics 

The high obtained PIE values provide evidence that 
high-frequency words do not reside in the same lexical 
equivalence classes as other high-frequency words (Carter, 
1987). As suggested earlier, a speechreader might optimize 
word recognition accuracy by selecting the most frequent 
word in a lexical equivalence class. Current models of spo- 
ken word recognition predict a bias toward selection of high- 
frequency words in the absence of other discriminating in- 
formation. We predict that word recognition in 
speechreaders employs the same biasing processes as those 
for auditory spoken word recognition, a hypothesis that 
awaits empirical testing. 

C. Modeling assumptions 

Two simplifying assumptions employed in our analyses 
are relevant when considering the relationship of our model- 
ing results to human performance. First, vowels and conso- 
nants were operationally defined to be equally distinct and to 
change their distinctiveness at equal rates. The decision to 
increase the number of vowel equivalence classes at the 
same rate as the consonant equivalence classes was a prag- 
matic one. In the literature, phoneme identification studies 
have not been conducted in such a way as to provide infor- 
mation on how visibility of vowels and consonants covaries 
for individual speechreaders. Such data are needed for accu- 
rately modeling changes in visibility across the entire phone- 
mic inventory of a language. 



Second, phonetic similarity was estimated based on 
identification of phonemes in monosyllabic nonsense syl- 
lables. Coarticulation effects arising from variation in sur- 
rounding phonetic contexts have been demonstrated to alter 
phoneme identification by speechreaders (Benguerel and 
Pichora-Fuller, 1982; Jackson, 1988). The use of phoneme 
identifications with monosyllabic nonsense syllables could 
lead to either an under-or overestimation of the phonetic in- 
formation available to the speechreader. Although coarticu- 
lation is frequently thought to reduce phonetic information. 
Church (1987) has argued (for acoustic speech) that it is 
informative for identifying words. We are currently collect- 
ing consonant and vowel identification data within a wide 
variety of disyllabic nonsense word contexts to further refine 
our estimates of the visually available perceptual informa- 
tion. Preliminary analyses suggest that the distribution of 
words in the language substantially preserves uniqueness 
even when variability due to phonetic context is taken into 
account (Auer et al., 1997). 
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*We use phonetic or visual phonetic distinctiveness to refer to the discrim- 
inability of visual speech stimuli and phonemic distinctions to refer to the 
phonological differences that distinguish words from each other. 
2 Baseform transcriptions are basic phonemic representations that may in- 
volve the undoing of phonological rules. For example, the baseform tran- 
scription of the word "writer” would be /'raitor/, which would involve the 
undoing of an alveolar tapping rule from the pronunciation /‘rairar/. 

Hue to homophony, the total number of entries in the analysis, 31 081, was 
reduced from the total number of orthographic entries in the lexicon, 
32 377. 
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